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I. INTRODUCTION 



Today, there are many problems concerning systems having spacetime dimensionality less than four. 
These problems involve a wide variety of topics ranging from condensed matter physics to gravitation. 
Some systems treated within this context are, however, only effectively three or two dimensional while 
indeed living in four spacetime dimensions. As an example, one can think of electrons within a carbon 
nanotube having a single layer of carbon atoms |4[]. These electrons are under the influence of the underlying 
lattice, which is actually curved. At this point, one can ask the following questions: Does the geometry of 
the lattice affect the motion of the electrons? If it does, how? 

Such questions can be answered by using a suitable method which takes care of the geometry of the 
system. We already know how to do this; the quantum mechanical central force problem is an example. If 
we move one step further, and change the picture a little bit, more interesting things happen. Consider the 
same example, the carbon nanotube, which has, say, a cylindrical shape, and let the length of the tube be 
much greater than its thickness. Then, one can treat the electrons as if they are 'constrained' to move on 
a cylindrical surface. So, how can this picture be viewed as the description of a constrained system, which 
obeys the laws of quantum mechanics? This will be the main question that is to be addressed in out work. 

One may well ask the following: We know that in classical mechanics, such constraints only reduce 
the number of degrees of freedom, and we also know that some quantum systems are already treated as 
two or one dimensional (like a two dimensional quantum gas, for example); so why should we expect 
to obtain a different picture than we have already? Although we cannot give any experimental evidence 
in our work, it is revealed that some interesting phenomena will occur under the influence of a curved 
lattice. There are a number of interesting studies on this issue, which assert that we should expect small 
quantum mechanical effects depending on geometry, see for example |4, (]]. What is more interesting is 
that, people have obtained contradicting results 17Q from different descriptions of the same system. Since 
one can formulate a well defined physical problem, which can be realized at least in principle, there should 
be one unique correct result involving possible geometrical effects. Up to now, people tried to understand 
the differences between two methods which are most often addressed, and see in what circumstances those 
different approaches gave the same result H8l- 1 1 lfl . 



In our work, we will try to explain those two approaches and re-derive some of the results present in 



the literature (see (4j, (], llfjl). Firstly, some necessary information about geometry will be given. Then, 
following the discussion in |4{], we will analyze the first approach which can be classified as a geometri- 
cal approach. This treatment relies on the geometrical relations which are valid in the three dimensional 
Euclidean space, and begins with writing the relevant quantum mechanical equation of motion, which will 
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be the well known Schrodinger equation at the first place. It is of course possible to consider the problem 
in N dimensional space, but for the sake of simplicity we will not deal with the general case. Next, we 
will explain a quantization procedure which has first been proposed by Dirac. As one can immediately 
understand, the discussion will begin from classical mechanics, and we will try to obtain proper quantum 
mechanical relations (if there are) via Dirac 's quantization procedure. This procedure is already formulated 
for a generic number of dimensions, so our discussion will involve an N dimensional Euclidean space and 
we will try to find the relevant Schrodinger equation again. Then, we will compare the results. As the last 
point, we will try to apply the first approach to Dirac equation, which can be considered as the beginning of 
a discussion for relativistic systems involving fermions. 



II. GEOMETRICAL PRELIMINARIES 

A. Curves in E 3 

In the next chapter, we will try to expand Schrodinger equation in powers of a coordinate in E 3 , and try 
to decompose the equation into two equations; one involving dynamics along a surface on which a particle 
is supposed to be constrained, and another equation which involves the direction that is transverse to the 
surface. This task requires the knowledge of some properties of curves and extrinsic and intrinsic geome- 
tries of surfaces in Euclidean space. Furthermore, in order to interpret the results of Dirac 's quantization 
procedure, which has also been mentioned in the introduction, one needs some information about the ge- 
ometry of surfaces. For this reason, we will briefly mention some properties of curves and surfaces in E 3 
relevant to our task, and give additional information when necessary. 

One can identify curves as "...paths of a point in motion. The rectangular (Cartesian) coordinates 
(x, y, z) of the point can then be expressed as functions of a parameter u inside a certain closed interval"^: 

Xj = Xi(u), i =1,2,3, u\ < u < U2- (1) 

This representation is known as the analytic (or parametric in some texts) representation [2J]. 

For the requirements of our discussion, we will assume that the parameter u is real and the rectangular 
coordinates x, under consideration are real functions of u. 

The following integral gives the arc length of the curve between two points on it as a function of the 
parameter u 12D : 



s(u) 
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We assume here that — is never zero within the given interval. With this assumption, one may parametrize 
du 

the curve equally well with the arc length s. When this is the case, the vector t = dx/ds is the unit tangent 
vector of the curve |2[]. There are two other important vectors attached to each point on a curve. The first 
one is the principal normal vector, defined via the equation dt/ds = /cn, and the second one is the binormal 
vector defined as b = t x n. These vectors satisfy the well known Serret-Frenet formulae Ja] (in some texts, 
the formulae are included as the formulae of Frenet, see for example |20): 



dt/ds 




K 




t 


dn/ds 




—K T 


X 


n 


db/ds 




-t 




b 



where k and r are the curvature and torsion of the curve, respectively. 



B. Surfaces in E 3 



There are various ways of representing a surface in the 3 dimensional Euclidean space (E 3 ). One is the 
familiar representation in which the surface is defined via an equation like f(x, y, z) = 0, where x, y and z are 
the Cartesian coordinates and / is a scalar function of those coordinates. One can also represent a surface 
with the Cartesian coordinates x, y, z being real functions of two independent real parameters u and v, called 
the 'analytic (parametric) representation' in a certain closed interval (2J]: 



Xi = Xi(u, v), 



U\ < U < U2, 



V[ < V < V2- 



(4) 



Here, i - 1, 2, 3 stand for x, y, z, respectively. 

It is generally assumed that the Cartesian coordinates are differentiable functions of the parameters up 
to a sufficient order. In fact, for all practical purposes, one may assume here that they are differentiable to 
all orders. This way one may expand x,- around a certain point (uq, vq) M\' 



Xi(h, k) = Xj(u , v ) + h ( ^ j + k j + ^ 



d d \ 
h— + k— Xi 
au ovj 



1 

+ n\ 



h— + k— | Xj 

ou ov 



+ • ■ 



(5) 



One also requires the parameters u and v to be independent so that the derivatives are independent. For this 
reason, the coordinate transformation matrix J2|] 

d(x,y,z) 



M - 

d(u, v) 

must have rank 2. Notice that we are dealing with a general coordinate transformation here. 
There are two important objects which are now to be defined. 



(6) 
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1. First Fundamental Form 



Consider some generic point P which lies on a surface X and let X be parametrized by u and v. Let 
r = r(u,v) be the position vector of P. If we now consider the squared distance between P and some P' 
which is in the neighborhood of P, we write: 

7 2 , , ldr(u,v) dr(u,v) \ (dr(u,v) dr(u,v) \ 

ds = dr • dr = du -\ dv ■ du -\ dv 

\ du dv J \ du dv j 

= g tiv {q)dq ti dq v , /u,v = 1,2, (7) 

where we identify u,v with q l ,q 2 respectively. g^ v is the metric tensor of X, and the expression for ds 2 
is called the first fundamental form of X [2j]. Here we also use the Einstein summation convention, which 
means that when an index appears twice in an expression there is summation over it. Equation [^completely 
determines the intrinsic geometry of the surface X. 



2. Second Fundamental Form 

dr dr 

One immediately notices that linear combinations of the vectors — and — can be used for expressing 

du dv 

all vectors tangent to X at P, so they naturally form a basis for those vectors. This means, the plane tangent 
to X at P, which is the tangent plane of X at P I2D, is spanned by these vectors. When a vector perpendicular 
to X at P is added to the naturally arising basis under consideration, one obtains an alternative basis for E 3 . 
This way, at least the points in the immediate neighborhood of the surface X can be expressed using this 
basis. We are going to discuss this issue later. 

The vector perpendicular to X at P which is of unit length and parallel to the vector Vf(x, y, z) (where 
f(x, y, z) - defines X) is the unit normal vector N(m, v) of X at P. Let us now consider the change in this 
vector as it moves from P to P' along some curve on X: 

dN(u,v) - — du + — dv. (8) 

du dv 



Notice that JN(w, v) is tangent to X, just as dr(u, v) is. The second fundamental form is now defined to 



be 



dN(u, v) ■ dr(u, v) - H /JV (q)dq M dq v (9) 



[In there is an overall minus sign in the definition, which does not affect the result included in this 
work.] The equation [9]determines the extrinsic geometry of the surface X. Here, H MV is a symmetric matrix. 
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dr(u, v) dr(u, v) 

Since both dN(u, v) and dr(u, v) are tangent to X, one may express g?N(m, v) in terms of — = and 



du 



dv 



Remembering that we have identified u, v as q x , q . 



- a n 



drjq) 
dq v 



(10) 



where the two equations for the two components of 



3N(g) 
dq" 



Qi = 1,2) are known as Weingarten equations 



|2J,|5|]. In some texts in the literature, the matrix a is called Weingarten curvature matrix |4J], or the extrinsic 
curvature fl8fl. 

There are a number of nice properties of the matrix a |4|] : 



1 

M - -tr(a), 
2 

A = det(a). 

1 , , 
H^v = jia^ gAv + g^a y ), 



(11) 
(12) 



where M is the mean curvature and A is the Gaussian curvature of X. These properties will be used in the 
calculations in the following chapters. 



III. THIN LAYER METHOD 
A. Treatment of Ferrari and Cuoghi 

The name thin layer method for constraining the equation of motion of a quantum mechanical particle 
to some surface is due to Golovnev II ill . The method has been used by da Costa ||5|] for a particle under 
the influence of a constraining potential but free to move on a surface, and by Ferrari and Cuoghi J4|] for 
a particle under the influence of an external electromagnetic field, which is also constrained to move on a 
surface. Both treatments involve the relevant Schrodinger equation ignoring the spin of the particle, and the 
constraining procedure begins at the level of the Schrodinger equation, unlike Dirac's procedure, which will 
be discussed later. 

Since the treatment of Ferrari and Cuoghi (given in |^)) is more general, it is useful to summarize this 
procedure here. 

We begin by making a general coordinate transformation in E 3 , and writing the metric with our new 
coordinates in the vicinity of some surface Z. Let us assume that two of our new coordinates q l , q 1 corre- 
spond to the parameters u, v of the previous section, and q 3 be defined as the distance from Z. Assuming 
that all q M , // = 1, 2, 3 are independent, we have three coordinates which define points uniquely at least in 
the vicinity of S. Let r(q l ,q 2 ) be the position vector of a generic point P lying on S and N(q l ,q 2 ) be the 
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unit normal vector of the surface at P. Then, the position vector of some point Q lying close to £ and a 
distance q 3 away from P is R(q l ,q , q ) = r(q , q ) + q 3 N(q l , q 2 ). Then we have the following metric: 

dR dR 

G » v ~ dqT' ' dq~ v 

-(—+ 3—) (—+ j --12 

G a = 0, G 33 = 1, (13) 

where the scalar product of the vectors is taken with respect to the Euclidean metric diag{\, 1, 1). Now, 
notice that the Weingarten matrix a enters into the expression for Gjf 

_ dr dr 3 / dr k dr k dr dr \ 3 , , , dr dr 

Gii = : • : + q \ : • Oj K r + a j r ■ : + (<r) Of; V ; r ■ : 

1 dq< dqJ H \dq< J dq k dq k dqJ j W J dq k dq l 

= gij + 2q 3 Hij + (q 3 ) 2 ai k g kl a< j, i, j = 1,2, (14) 

where we used the definitions of the first and second fundamental forms and the relation between them 

(equations 171 l9l PT2l). Here one may argue that this expression for the metric tensor is at 0((q 3 ) 2 ), while the 

expression for the position vector of Q is at 0(q 3 ). However, we did not begin with the Taylor expansion 

for the position vector of Q, which is also possible. So, with our choice of the curvilinear coordinates qf, 

the expression for R is exact. One may still expand R in powers of q 3 , but the metric tensor obtained by da 

Costa |5D and Ferrari and Cuoghi J4J] will be a special case then. We will see later that beginning with the 

assumption that the third coordinate is orthogonal to our surface will bring some conditions on the terms 

appearing in the expansion. 

Now we are ready to write the Schrodinger equation. Let A(g) be the vector potential and cD(«) be 

the scalar potential corresponding to our external field. Let us define the gauge covariant derivative J4J] as 
iQ 

D v = V y — t~^ v ' wnere Q i s the charge of our particle. We also define a gauge covariant derivative for the 
time variable [Q] as D Q = d t + — O. Using the well known expression V 2l P = -^=dJ VGG^ V <VF) for the 

h Vg 

Laplacian of the function l P, where G = detiG^y), we obtain, by direct substitution, the following equation 



ihDtfV = -3- 
2m 



■^=dJ VGG^dyV) + l -^LdJ VGG^M V )T + HQhG^AydnV + G 2 G^ v A„A v f 

Vg Vg 



(15) 



First, notice that by direct calculation: 

VG = VS(l + q 3 Tr(a) + (q 3 ) 2 det(a)) , (16) 
where Tr(a) - a\ 1 and det(a) - det(ai 7 ). 
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Now, let us consider the normalization integral: 

= J d 3 qyfg~(l + q 3 Tr(a) + (q 3 ) 2 det(a)) ^(q^iq) = 1 

If we now define a new wave function xiq) such that [4J] : 

X (q) = (l + <? 3 7>(a) + (g 3 ) 2 J^(a)) 1/2 , 

the normalization integral becomes: 

J d 3 q^(l + q 3 Tr{a) + {q 3 fdet{a))v*{qmq) = J d 2 qdq 3 y[gX*(q)x(q). 

Notice the difference in the integral measure. This way, we also separate the normalization inte- 
gral into two parts, which we will now try to do for the Schrodinger equation, using the new wave 
function xiq)- Introducing an extra constraining potential Vxiq 3 ) and calculating the derivatives of 
(l + q 3 Tr(a) + (q 3 ) 2 det(a)^j ^ , we get the following equation in the limit q 3 —> |4]: 



(17) 



(18) 



(19) 



ihDvx = 7T- 
2m 



~^di{ ^g' J djx) + ^g' J Aj)x + 2iQhg"AidjX + Q 2 (g lJ AiAj + {A 3 f) X 
yg yg 



n 2 (d 3 ) 2 x + iQh(d 3 A 3 )x + 2iQfiA 3 {d 3X ) -h 2 \[ ~Tr(a) I - det(a) 



X 



+ V A (q 3 ) X , 



(20) 



noticing that 



lim (l + q 3 Tr(a) + (q 3 ) 2 det(a)) +1/2 = 1, 



q 3 ^> 



lim d 3 ((1 + q 3 Tr(a) + (q 3 ) 2 det(a)) +U2 ) = ±Tr(a)/2, 

3-» V ' 

lim d\ ((1 + q 3 Tr(a) + (q 3 ) 2 det(a)) ±1/2 ) = ±det(a), 



q 1 ^ 



lim di f(l + q 3 Tr(a) + (q 3 ) 2 det(a)) + ) = 0. 
q 3 ^ o v ' 



(21) 



-fi 2 

Ferrari and Cuoghi call the term V s - — — 

2m 



-Tr{a)\ -det{a) 



as the 'geometric potential'. In order 
to get rid of the terms that indicate the coupling of the surface coordinates and the coordinate normal to 
the surface, one asks if a gauge transformation is possible. Ferrari and Cuoghi discuss that the equation 
(2.3) is gauge invariant (see Appendix A), so one can choose a gauge in which A3 = using the function 
y = - L A 3 (q l , q 2 , z)dz M\ for the gauge transformation. Then, the equation (2.8) can be split into two 
equations, while the wave function is now written as xiq) = Xn(q 3 )Xs(q l , q 2 ) 



h 2 , , 

ihDox„ = -—(d 3 rxn + V x {q s )Xn, 
2m 



itlDvXs = 7T- 
2m 



H 2 iOfi 

-pdti yfgg lJ djXs) + y/gg? J Aj)x* + VQhgVAidjXs + Q 2 g' J AiAjXs 
yg yg 



(22) 

+ V sXs . (23) 
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Having decoupled the dynamics on the surface and in the normal direction, one may now treat the above 
equations separately to solve for the wave function x, and calculate the expectation value of any relevant 
physical quantity on the surface without referring to the 'external world', that is, using only the quantities 
denned on E being independent of the normal direction. 

B. Taylor Expanding the Position Vector 

As we noted, the squeezed coordinate q 3 is chosen to be the distance from the surface to which the 
particle is constrained in the analyses of da Costa and Ferrari and Cuoghi. Another possible approach 
would be to consider a more general case of a coordinate transformation in which two of the coordinates 
again parametrize the surface and the third one is again chosen to be orthogonal to the surface. In Euclidean 
space, the distance from the surface coincides with the third coordinate of the curvilinear set if and only if 
the coordinate curves of q 3 are straight lines. This can be understood by noticing the fact that in Euclidean 
space, the shortest path between two points is a straight line and the distance between the points is the length 
of that path. Then, when this is the case, the discussion simply reduces to the one given by da Costa and 
Ferrari and Cuoghi. One may also ask whether the coordinate q 3 corresponds to the arc length along the 
coordinate curve, or not. Let us first consider the general case, then answer this question. 

In general, assuming that q 3 is orthogonal to the surface E, one may expand the position vector of a point 
in the neighborhood of E as follows: 

R - R + (d 3 R) Q q 3 + ^(djRHq 3 ) 2 + -(^R) (<7 3 ) 3 + ■■■ , (24) 

where the subscript implies that the quantity is evaluated at q 3 = 0. Notice that the first derivative of 
the position vector with respect to q 3 is always perpendicular to E, since it measures the rate of change of 
the position vector along a coordinate curve which is by assumption orthogonal to X. This expansion still 
handles all three dimensions, so the coordinates are by definition independent from each other. This fact 
will be useful soon. We have, for the metric tensor = d M R ■ d v R, the usual definition. Now, our choice 
(or assumption) that the third coordinate is orthogonal to E, combined with the trivial fact just mentioned, 
brings the following conditions when one writes down the metric up to second order by direct calculation: 

0(q 3 ) : di(Ro) ■ R£ + <9 ( <(R')o) • (R')o - 0, (25) 
0((q 3 ) 2 ): 23,(R[,)-Ro'+^(Ro)-Ro" + ^( R o)-Ro- ' ( 26 ) 

where the primes denote differentiation with respect to q 3 . At this point, let us assume that the third co- 
ordinate is chosen to be the arc length along the coordinate curve. Then, using the Serret-Frenet formulae 
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mentioned in Chapter 1 Section 1, one has: 



R = R + t q 3 + l(m) (<7 3 ) 2 + gj (-«t + rb) (^ 3 ) 3 + ■ ■ ■ 



(27) 



and for the above condition at 0{q 3 ): 



di(R ) ■ (m) + <9/(t) • t = 



(28) 



Now, notice that the second term is zero by definition of t. So, the first term should also be zero. Now 
notice that, for nonzero m, this is impossible, since by definition n is perpendicular to t and hence no is 
tangent to E. But a vector tangent to £ cannot be perpendicular to both of the vectors spanning £ at the 
same time, which is required by the condition. The only possibility is that (/cn) = 0. This may somehow 
be satisfied locally on certain surfaces, but if we begin with a coordinate transformation which is globally 
valid (or valid at least within a finite region of space), then m = should hold globally (or within the finite 
region considered). This means, one requires the coordinate curve of q 3 to be a straight line at least within 
a relevant finite region, like the neighborhood of £ under consideration which is regarded as a finite region 
of space at the beginning (before squeezing the system to the surface). Conversely, if the coordinate curve 
of q 3 is a straight line, then q 3 = aX + b, where A is the arc length parameter (and so the distance from Z) 
and a and b are constants. This result reduces our problem to the case considered by da Costa and Ferrari 
and Cuoghi. 

Now, we may concentrate on the other possible situation, in which q 3 is not the arc length along its 
coordinate curve. The metric tensor in this case becomes: 



up to 0((q 3 ) 2 ). Notice that the 0(q 3 ) term of is the same with the one obtained in the previous section, 
since (9;Ro is tangent to £ and so <3,(Ro) • (9/(R'o) = <5KRo) • d/(to) = 3 ; (Ro) • <5/(N). But there are additional 
terms in the second order term, and also G33 ± 1. One may still handle the problem by calculating the 
contributions to the determinant of G MV , but the measure of integrations which arise in the normalization 
and in the calculations for expectation values also changes, and takes the form: 



Gij = 8i Ro + R' q 3 + ^jR'W) 2 • dj R + R'o? 3 + ^"o(q 3 f 
= di(Ro) ■ dj(R ) + q 3 [di(Ro) ■ dj(R' ) + dtfR'o) ■ dj(R )] 
+ h,q 3 ) 2 [d ; (Ro) • <3;(R" ) + W'o) ■ 3/R ) + 2<9 i (R'o) • 5/R'o)] , 

G 33 - R'o • R'o + 2</ 3 R' • R"o + (q 3 ) 2 [R"o ■ R"o + R'o • R"'o] , 



(29) 



d 3 q = dc? d 2 q yjGyi yjgt 



new 



(30) 
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Although one may still find the expansion in powers of q 3 corresponding to y/g ne w, the factor VG33 spoils 
the separation of the integral into a surface integral and an integral along the normal direction which are 
independent of each other. This means, it is not guaranteed that we still have the chance for calculating 
expectation values or cany out normalization on the surface without any reference to the 'external world'. 



C. Spin-Magnetic Field Interaction 

It would be interesting to consider the spin-magnetic field interaction within the context of the thin layer 
method. Let us consider the following interaction 

H / = --f-B-S (31) 
Line 

where e is the magnitude of the electron charge, m is the electron mass, c is the speed of light in vacuum and 
S is the 2x2 spin operator composed of the well known Pauli spin matrices. In terms of a vector potential 
A, this term can be written as: 

Hj = --^V x A • S = -^-^Q7 A v yS A (32) 
Line Line 

where e 123 = 1. In order to calculate the effects of the curvature of E, we need to calculate the connection 
coefficients in the coordinates used. Using the well known coordinate expression for the coefficients: 

^vA ~ 2^ P ^p v - a + GpA,v - G v x,p), (33) 

and the inverse of to 0(q 3 ): 

G ij = g ij - 2q 3 H ij , (34) 

we get the following expressions: 

r 3 - - 

1 v3 ~ 1 33 U ' 

T U = ~ H ij ~ k Skia l j, 
T*3 = g k Hu - q 3 H km H mi , 

rf . = f* + q 3 [g kl {H lUj + H tj j - H m ) - H k \ gli ,j + gIjJ - gij j)} , (35) 

where t k - - —g^iguj + gijj - gijj)- We calculated the connection coefficients up to 0(q 3 ) since there are 
only first derivatives present in the expression considered. Although the expression for G py is exact in the 
case analyzed by da Costa, the inverse is calculated to first order in q 3 since we will at the end take the 
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limits of everything at q 3 = 0. Using these, we have the following expressions for V x A: 

(V;A;) - (diAj) Q - ff/AjOo + Hij(A 3 ) Q , 



(V 3 A> = (d 3 Aj) Q - g kl H,j(A k ) , 

(V;A 3 ) - (djA 3 ) - g kl H,j(A k ) Q . (36) 



So, finally, we get the following expression for the interaction term: 



Hr = -^-[S 3 B 3 + e^S i(djA 3 - d 3 Aj)] , (37) 
2mc 



by putting the above terms directly into (T32I ). The result that can be derived from this expression is that spin- 
magnetic field interaction always includes coupling with external world and this is unavoidable. Otherwise, 
the interaction term will simply be zero, meaning that there is no interaction at all. 

IV. DIRAC'S QUANTIZATION PROCEDURE 

1. Overview of the Procedure 

Another method of handling geometrical constraints imposed on physical systems was discussed by 
Dirac in 1950s |l] (see also the references therein). The method involved modification of canonical quanti- 
zation procedure beginning from the Lagrangian level, and carrying the constraints properly to the Hamil- 
tonian formalism. Dirac also modified the terminology, and used the word 'constraint' in a more general 
manner. We will also follow this usage, and indicate the usual geometrical constraints explicitly where 
necessary. 

We will, in this section, briefly summarize the relevant parts of the first two chapters of |Qj, which cover 
all issues relevant to the problem discussed in the previous chapters of our work. The discussion in QjJ] 
about this issue begins with a reminder of the ideas of Lagrangian formalism and transition to Hamiltonian 
formalism. Although the motivation is to formulate some Hamiltonian formalism for a field theory, Dirac 
begins the discussion with particle dynamics. 

As is well known, the classical equations of motion for a particle moving in an N dimensional space can 
be obtained by extremizing the so-called action integral: 

S - J" L(q, q, t)dt, 

8S = 0, (38) 

where q n , (n = 1, • • • , AO are the generalized coordinates of the particle, and the over dot denotes differen- 
tiation with respect to time t. Transition to the Hamiltonian formalism begins by defining the momentum 
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variables conjugate to these coordinates as follows: 

Pn = ^- (39) 
oq n 

Usually, one assumes that the momenta are all independent from each other. Here, Dirac notes the following 
1 1|]: "We want to allow for the possibility of these momenta not being independent functions of the velocities. 
In that case, there exist certain relations connecting the momentum variables, of the type <f>(q,p) - 0." 

This statement seems to be rather implicit, since one cannot directly visualize what sort of relations can 
appear while calculating the momenta. There is one easier way of revealing these relations via inserting the 
geometrical constraints which are usually stated as f{q) = as functions of coordinates into the Lagrangian 
with the motivation to use the method of Lagrange multipliers, as used by other people in the literature 

L new = L + A s f s (q), (40) 

where f s {q) = are the functions defining the geometrical constraints and A s are the corresponding La- 
grange multipliers, which are also treated as dynamical variables in the method of Lagrange multipliers. 
We also adopt here the summation convention used in the previous chapters. One deals with this new La- 
grangian and finds possible relations among the momenta. But of course, this is not necessarily the most 
general way of using Dirac 's procedure, and as we will see, the results predicted by the procedure will 
depend on the explicit expressions of the geometrical constraints |SD. 

Having noted this fact, we may return to Dirac's discussion. Dirac calls the relations <p m (q,p) = 0, m = 
1, • • • , M as 'primary constraints of the Hamiltonian formalism' QJLJ] . Notice that they appear at the stage of 
defining the momenta. Then, one defines the Hamiltonian in the usual way: 

H = p„q n - L. (41) 

However, as is also well known, Hamiltonian defined in this way is not unique. One can equally well write 
the following Hamiltonian, simply adding some 'zeros' to the usual one |1|]: 

H*=H + c m <f> m . (42) 

As stated by Dirac, the coefficients c m can be functions of q n and p n . The crucial point is the following yj]: 
"H* is then just as good as H; our theory cannot distinguish between H and H*. The Hamiltonian is not 
uniquely determined." 

Remembering that the Hamiltonian equations of motion are 
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they now become BjJ] 

8H d(p m . dH d(p, n 

q n = o + U m , p n = ~ ~7. U m - , (44) 

opn op n dq n dq n 

as one decides to use H* instead of H. Here, u m are unknown coefficients (we might have kept the c m 's and 

so their derivatives, but here we follow Dirac). Although we will not use Hamilton's equations of motion, it 

is important to note how they are affected when certain constraints are imposed on the system. 

In order to proceed more consistently and in a way which is suitable for quantum mechanics, one needs 

to define Poisson brackets. As is well known, the Poisson bracket of a quantity with the Hamiltonian gives 

the time evolution of that quantity in the Hamiltonian formalism in classical mechanics. In the canonical 

i 

quantization scheme which is already familiar to us, classical Poisson brackets are replaced by — times 

n 

the commutator of the entities involved, and also the functions representing physical entities are replaced 
by operators. The usual definition of Poisson brackets is the following: 

dq n dp n dp n dq n 
and it satisfies the following important properties: 

[f,g] = -[g,f\, (46) 

[fi+f2,g] = \fug] + [f2,g], (47) 

U\f2,g]=fl[f2,g] + [fl,g]f2, (48) 

[/, [g, h]] + [g, [h, /]] + [h, [f, g]] = (Jacobi Identity). (49) 

Here Dirac makes an important remark. With the above definition of Poisson brackets, one cannot write 
the Poisson bracket of a quantity which may not be a function of q n and p n , but may be a function of time 
1 1|]. This is the case for the unknown coefficients u m above. In order to avoid such a situation and express the 
time evolution of any quantity in some Hamiltonian formalism on the same footing with the time evolution 
of functions of q n and p,„ Dirac suggests to leave the usual definition of Poisson brackets and take the four 
equations (l46l ). (1471) . d48l) . (l49l) stated above as the defining properties for Poisson brackets [1]. Now, notice 
that: 

dg dg 

g - -7. — 4n + o Pn = [g, H] + U m [g, <f> m ], (50) 

oq„ Op n 



for some g - g(q, p) using the equations (1441) and (I45I ). Here, Dirac asks whether one can write this equation 
as: 



g = [g,H + U m <f>m\, 



(51) 
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and gives the answer that, this is possible if we use the equations (l46l)-d49l) as the denning properties of the 
brackets. Then, we obtain: 

g = [g,H + u m (f> m ] = [g, H] + u m [g, cp m ] + [g, u m ](f> m . (52) 

Now, we remember that indeed <p m - and the last term should vanish. So it does. But why did we use this 
fact at this point rather than at the very beginning? Dirac's gives the answer and derives an important rule 
of quantizing constrained systems tlfl: "We have the constraints <j) m — 0, but we must not use one of these 
constraints before working out a Poisson bracket. ... So we take it as a rule that Poisson brackets must all 
be worked out before we make use of the constraint equations." 
As a reminder, he devises the following notation: 

<f>m * (53) 

for the equations which are to be used after all Poisson brackets are worked out, and calls such equations as 
weak equations; implying that the equations which are not of this sort are strong equations. Then, the term 
[g, u m ]<f> m vanishes, and the expressions for g agree. 

From now on, the quantity H + u m <p m will be called the total Hamiltonian tlO and will be denoted by Hj. 

At this point, it is necessary to state another important fact. The time evolution of the constraints should 
also be taken into account, and all time derivatives of the constraints should vanish to ensure consistency. 
For this reason, the equations stating that the time derivatives of the constraints vanish are called consistency 
conditions 1 An important question arises here: there can be an infinite number of consistency conditions, 
because the constraint functions can be differentiable to all orders; so which equations are the relevant 
consistency conditions to our problem? 

Dirac gives the answer to this question by classifying the equations resulting from equating the time 
derivatives of the constraint functions to zero liO : 

Case 0: An inconsistency may occur. This implies that Lagrangian equations of motion are inconsistent. 
Dirac's example is the equations resulting from the Lagrangian L = q which directly lead to 1 = 0. So, the 
relevant cases are those in which Lagrangian equations of motion are consistent. 

Case 1: Some equations may reduce to = with the help of primary constraints. 

Case 2: Some equations may reduce to equations which are independent of the coefficients u m . They 
are then equations like x — X(q> P) - 0, which means they are indeed new constraint equations giving 
new relations between the dynamical coordinates and their conjugate momenta. These are called secondary 
constraints. For consistency, one should continue calculating the time derivatives of these functions to 
ensure consistency; that is, new consistency conditions may result from these functions. 
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Case 3: The equation may reduce to one which brings conditions on u m . At this point, the procedure for 
the relevant constraint (either primary or secondary) ends. 

So, the procedure up to the present level can be summarized as follows: write down the Lagrangian, 
then define the momenta and the Hamiltonian, determine the primary constraints, construct the total Hamil- 
tonian, calculate the time derivatives of the constraints, classify them using the above classification. Case 
1 equations bring nothing new, so they are trivial in a manner. Case 2 equations imply new constraints, so 
their time derivatives should be calculated and set equal to zero until Case 3 equations are reached. 

As Dirac himself notes QJJ] , 

The secondary constraints will for many purposes be treated on the same footing as the 
primary constraints. ... They ought to be written as weak equations in the same way as 
primary constraints, as they are also equations which one must not make use of before one 
works out Poisson brackets. 

One may well ask the following question: Why do we terminate the procedure when Case 3 equations 
are reached? Dirac gives a detailed discussion, but it will be sufficient to give only the idea. Case 3 equations 
give u m as functions U m of q n and p„ (because the Case 3 system of equations should have some solution 
if the equations of motion are consistent 111]). But, after working out the Poisson brackets, one may treat 
the constraint equations as strong equations; so, the constraint terms, together with their coefficients, will 
vanish. 

Of course, there will still be some arbitrariness in the Hamiltonian, because one may add terms which 
are weakly equal to zero with arbitrary coefficients to the solutions for u m obtained from Case 3 equations. 
Those arbitrary coefficients may be functions of time, and will be totally arbitrary. Dirac takes this as the 
reflection of some arbitrariness in the mathematical framework, like gauges in electrodynamics. Another 
interpretation might be that the number of independent degrees of freedom is less than the number of 
generalized dynamical coordinates, and there is still a freedom for observer choice. 

Here we should also define the first class and second class quantities. A quantity which is a function of 
q n and p n is labeled as first class if its Poisson bracket with all constraints vanishes at least weakly; and is 
second class otherwise [1]. It is noted in the text [l|] that primary first class constraints are the generating 
functions of contact transformations, but it is not quite obvious that this fact can be made use of in the 
analysis. 

Now one is ready to define a bracket method for handling time evolution of quantities. For this, one 
should first notice that the total Hamiltonian is a first class quantity. Here, it is necessary to follow Dirac's 
reasoning directly to see why the total Hamiltonian is a first class quantity, and what this brings to us. 
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First, remember that Case 3 equations can be solved for u m and then u m are obtained as functions of 
q n and p n . Those Case 3 equations are indeed a system of non-homogeneous equations in the unknowns 
u m . But since the system is non-homogeneous, one may add the solutions of a homogeneous system of 
equations like V m [<f>j,<p m ] = 0. Here, j = l,--- ,J counts all constraints (primary and secondary). Then, 
if the independent solutions of V m [<pj,<p m ] = are V am (q,p), a - l,--- ,A, one may write u m (q,p) = 
U m (q,p) + v a (t)V am (q, p). Notice that the coefficients v a can in general be functions of time, and are the 
totally arbitrary coefficients mentioned above. Then, the total Hamiltonian becomes |lt : 

Hj = H + U m <p,„ + v a V am (p m = H' + v a (p a . (54) 

Notice that this total Hamiltonian is first class. H' is first class since we impose [<pj, H'] = for consis- 
tency. (p a = V am 4> m are first class due to involving the solutions of the homogeneous system of equations 
V m [<pj,<f> m ] = 0. 

There is one more concept to be defined. It is what Dirac calls the 'physical state' of the system. In 
classical mechanics, one solves the Lagrangian equations of motion and calculates the time evolution of 
the system. This tells one that, what the observer should actually observe is described by the solutions of 
those equations. This gives the understanding about the physical state; the 'physical state' of the system 
concerns what the observers will actually observe. The solutions of the equations may seem to be different 
quantitatively due to observer choice, but a proper transformation from one observer to another one will 
reveal that different observers describe the same dynamics. Then, in Dirac's approach, the arbitrary features 
in the formalism should somehow not affect the state of the system, although they may change the values 
of the dynamical variables. So the following question arises: Which constraints do not affect the physical 
state of the system? 

In classical mechanics, once the initial conditions of a system are completely known, all history of the 
system can be worked out. In other words, the initial physical state of the system determines the past and 
the future of the system completely. Now, consider some dynamical variable g, and let g(t = 0) = go be 
given. After a short time 8t, g evolves as : 

g{8t) = g + gSt = g + [g, H T ]St = g + 5t ([g, H'] + v a [g, <f> a ]) . (55) 

If we now change the set of v a values (making use of their arbitrariness), we will have a different g. Taking 
the difference, we get: 

Ag{6t) = 5t(v a - v' a )[g, <t>a\ = 5te a [g, fl ] . (56) 

Now, notice that <p a are formed by primary constraints, and are also first class. If the physical state remains 
unchanged, as we require, we are led to the conclusion that primary first class constraints do not affect 
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the physical state of the system. Also Dirac notes that the above equation means (f> a are generators of 
infinitesimal contact transformations, and further, conjectures that secondary first class constraints do not 



affect the 



physical state either. However, there are counterexamples of this conjecture, mentioned in the 



literature [12]. In any case, we will not deal with such examples and construct the procedure as if Dirac's 
conjecture is correct. This will amount to adding the secondary first class constraints to the Hamiltonian 
and defining the extended Hamiltonian OjJ] : 

H E = H T + V a 4a'. (57) 

Now, all independent accessible constraints are included in the Hamiltonian, and the time evolution of any 
system will be given as: 

g = [g,H E l (58) 

The last point in the construction of the theory is defining a proper bracket formalism. Then, the idea is 
simply converting the brackets into -i/h times commutators, like canonical quantization. 

In order not to deal with physically irrelevant degrees of freedom, Dirac defines a new bracket in terms 
of Poisson brackets |[l|] : 

If, gY = If, g] - If, <PiWu 0*. Sl (59) 
A« s (60) 

Here, k, I = 1, • • • , /, count all constraints. These brackets satisfy the four important properties of Poisson 
brackets (l46l)-d49l) (see Appendix B for proofs). After defining these brackets, since all Poisson brackets 
have been worked out, one can treat the constraint equations as strong equations. This new bracket is often 
called the 'Dirac bracket'. 

The issues mentioned in this section gave an outline of the relevant parts of Dirac's treatment in QJJ] . In 
the following sections, we will investigate various cases which have been discussed in the literature. 

A. Treatment of Ogawa, Fujii and Kobushkin 

In this section, we will briefly summarize the discussion in [6] and give some comments on the proce- 
dure. 

The discussion concerns an N dimensional Euclidean space and begins from the Lagrangian level as 
Dirac's procedure requires. As we mentioned in the previous section, there is some arbitrariness in express- 
ing the constraints within the formulation of the problem, and one option is to make use of the method of 
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Lagrange multipliers; this has been preferred by Ogawa et. al. The classical Lagrangian is then 



A: 



L = -x a x a - V(x) + Af(x), 



(61) 



where the particle is assumed to have unit mass for simplicity, xf (a — 1, • • • , N) are the Cartesian coor- 
dinates of E N and f(x) = is the function expressing the geometrical constraint. As usual, the overdot 

dL 

denotes time derivative. In terms of the Cartesian coordinates, we have p a = - x a as momenta conju- 

dL n 

gate to x a and p\ - — « as our primary constraint [6J]. Following Dirac's procedure, one obtains (see 
~] dA 

Appendix B) (gj: 



H = \ P aP" + V(X), 

[x",Pb]D = 5" b-n a n h , 
[p a ,Pb]o = P c (n b d c n a - n a d c n b ), 
[x a ,x b ] D = 0, 
f{x) = 0, 
P a d a f = 0, 



(62) 



(63) 



(64) 



df 



where n a = is the unit normal vector of the iV - 1 dimensional hypersurface defined by f(x) = 0. 

ox a 

Notice that we changed our notation for Dirac brackets [A,B]p, which was expressed as [A,B]* in the 
previous section. Ogawa et. al. derive the classical equations of motion from Hamilton's equations using 
Dirac brackets and show that they are the expected equations of motion given by the usual Hamiltonian 
formalism (see Appendix B for details). 

At this point, a general coordinate transformation is performed (60 : 



Bx" 

dx a = —d<f, 
oqf 



(65) 



where // = (),••• ,N - I. The authors choose the q° coordinate to be orthogonal to the hypersurface defined 
by f(x) = 0. Then, f(x) = simply becomes cf - 0, and p"d a f - x a d a f - becomes x"d a q° - q° - 0. In 
terms of the new coordinates, we have the following metric: 

dx* dx* 



8tIV ~ dqf dq- Sat ' 



(66) 



[guv] 



goo 
Sij 



(67) 
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Here, gjj coiTesponds to the first fundamental form of the hypersurface (i.e. the metric induced on the 
hypersurface). The classical equations of motion are again the expected equations. In terms of the new 
coordinates, the brackets are (see Appendix B for details): 

[cf,p v ] D =$' v -rfn v , (68) 
[^,? v b = [^,/» v b = 0. (69) 

The important result at this point is that in classical mechanics, there is no effect of the 'external world' 

n 

on the dynamics of our particle [6JJ. In the quantum mechanical treatment, the effect of this external world 
depends on how quantization is performed (6j,|8|]. In Dirac's procedure Dirac brackets are replaced by -i/h 
times commutators, and this is a fixed ingredient of the recipe. However, the ordering of operators is not 

n 

fixed [80, and one needs to impose some ordering. The choice of Ogawa et. al. is to write the product 
of any two operators (notice that this includes simple functions, coordinate transformation matrices, usual 
momentum operators etc.) in a manifestly symmetric way: 

{A,B\ = ^{AB + BA). (70) 

It is important to stress that this is hypothesized in the work of Ogawa et. al. ; different orderings may also 
be chosen. 

With this hypothesis and the above information, the Hamiltonian operator takes the following form [6J] 
(see Appendix B for details): 

h = \g- 1/4 Pig 1/2 g ij Pjg- 1/4 + Via) + ^ 00 ri rjo C7D 

with the constraints 

q° = 0, po = 0. (72) 

Here, g = det(gij). This result is obviously different from that of da Costa (and other people who treat the 
problem in the same way as da Costa). The kinetic term is not simply -ti 2 times the Laplacian defined on 
the hypersurface (i.e. g~ l ^ 2 Pig l ^ 2 g' J Pj)\ and the additional potential term is also different. The difference 
between this kinetic term (named as Laplace-Beltrami operator in |(]]) and the usual Laplacian is not a scalar 
function which may be added to the additional potential, but is another kinetic operator: 

[g~ 1 ^ 4 , Pig l ^ 2 g'-i Pjg~ 1 ^] - (-h 2 x difference of Laplace-Beltrami and Laplacian) 

= |r 1/ V / V^+w 1 W. (73) 



22 



Let us also analyze the additional potential terms further. First, notice that there is a difference in the two 
metric tensors, that of da Costa and Ogawa et.al. In order to compare the two additional terms, let us 
now consider the case in which goo = 1- Then Ogawa et.al.' s additional term becomes T' j0 T J j Q ; but the 
additional potential derived by da Costa becomes something different. Notice also that, one should consider 
the connection coefficients in the q 3 = limit of da Costa's treatment. Using the metric introduced by da 
Costa and taking the relevant limit, one obtains (see [35]): 

rjb = «*«, (74) 

Y k kQ = a k k = a k k = Tr(a), (75) 
V s (da Costa) ~ 2r k Q T k0 - Y^, (76) 

where a is the Weingarten matrix. Notice that a has previously been defined as an object in E 3 . However, 
the same definition of a applies to the general case, in which the constraint surface is an - 1 dimensional 
hypersurface, having unit normal n, of E N . In the above lines we have made use of this fact. The last line is 
valid only when one studies the problem in E 3 ; the reason for this situation is given in the following section. 
The difference between the two approaches is discussed in the literature (see for example |7, ^]), but it is 
not clear which method is the correct one in a realistic problem. 



B. Treatment of Ikegami, Nagaoka, Takagi and Tanzawa 

In this section, we will briefly discuss the idea of Ikegami et.al. explained in (80. 

The idea proposed by the authors is a simple modification of Ogawa et.al. 's treatment. They replace the 
geometrical constraint f(x) = (defining an N - 1 dimnsional hypersurface of E ) which is included in the 
Lagrangian in the previous treatment, with its time derivative set equal to zero: 

/(*) = x"d a f(x), (77) 
L = -x a x a - V(x) + Ax a d a f. (78) 

With this Lagrangian, one obtains the following Hamiltonian and brackets (see Appendix B) |^]: 

H = l -p a (5 ab - n a n b )p b + V(x), (79) 
[x a ,x h ] D = [p a ,p h ] D = 0, 

[x a ,p h ] D = 6 a b . (80) 

Notice that the kinetic part is the square of the momentum tangent to the hypersurface. When the usual 
replacement of momenta with -ifi times partial derivatives is performed, the kinetic part becomes -H 2 



23 

times the Laplacian of the hypersurface Jill (this is what Golovnev directly suggests): 

V 2 = (V - n(h ■ V)) 2 . (81) 

At this point, the authors make a further modification, and relax the coordinate transformation hypothesis of 

Ogawa et.ai; they suggest considering the most general linear combination of operators at the quantization 

step [8]. With this suggestion, one writes the following kinetic term (the authors perform the calculation in 

a somewhat different way, but our results here will coincide with theirs; see Appendix B for details): 

1 i / 

H = -p a Pa ~ -\An a p a n b p h + Bp a n a p b n b + Cp a n a n b p b 
+Dn a p a p h n b + Ep a n b p b n a + Fn a p h n b p a 

+G PaPb n a n h + Hn a n b p a p\ (82) 



where the coefficients are arbitrary complex numbers, and the term including these coefficients is defined 
as T n within the text. Since this operator is part of the Hamiltonian, that is, an observable, it should be 
Hermitian. This condition gives: 

A* = B, C* = C, D* = D, E* = F, G* = H. (83) 

Now, in order to make a comparison with Ferrari and Cuoghi's 'geometric potential', let us restrict the 
discussion to three dimensions (i.e. to £ 3 ) and write the additional potential of da Costa in the following 
way US: 

h 2 ( 1 , 1 , \ 

Vs = -T- \--;{Tr{a)f + -Tr{a 2 )\ (84) 



2 \ 4 v 2 
using the following fact (since a is a 2 x 2 matrix) 

det a = —^ei'irai ''a,- J ' , (85) 
2! 

iUrj =& f 8> f -& f 6> t , (86) 

(7> af - Tr{a 2 ) - 2det a. (87) 

In order to be able to calculate the desired additional potential, one tries to find the necessary conditions on 
the coefficients. If one also tries to obtain the Laplacian of the hypersurface in the kinetic part, one should 
try to write d82l as -{p"p a - p a n a n b ph) plus remaining terms, and try to generate da Costa's potential 
from the remaining terms. Our results concerning this task has put forward the following conditions on the 
coefficients (see Appendix B): 

2Re(A) + 2Re(G) + 2Re{E) + C + D = 1, 

A+G* - -D, Im(E + G) = 0. (88) 
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In our work, we found that Ogawa et.al.'s ordering hypothesis does not give the desired results (see Ap- 
pendix B) and this is in agreement with Ikegami et. al.'s result. The authors also assert that there should be 
some condition imposed on the operators to fix the ordering. However, they take T n as a purely quantum 
mechanical object, which puts the resulting Hamiltonian in accordance with that of da Costa's |8[]. 

Here, there is another important issue. The comparison of Ogawa et.al.'s and da Costa's results leads 
us to restrict the discussion to E 3 , because the form of the geometrical potential given by da Costa includes 
Tr(a) and det(a), while the other includes Tr(a) and Tr(a 2 ), and da Costa's expression is converted into 
(l84l by making use of (I87T ). When higher dimensions are considered, the two expressions may not be 
comparable in general. For example, in four dimensions (that is, when a is a 3 x 3 matrix): 

det a = ^((Tr a) 3 - 3Tr(a) 2 Tr a + 2Tr(a 3 )), (89) 



making use of 



deta=— e ijk e VfV a t '" aj f a k k ' , (90) 

e ljk €i> j'k' =6' i<5 j y5 h t ~ 8' j<5 j y5 k t + & j'5 J k'8 k ? 

t 6 j r S k v + S< v 6> i>8 k f - S< i>6 j V 5 k f . (91) 

These together reveal that the values of the complex coefficients used in d82l ) depend on the number of 
dimensions. 

V. DIRAC EQUATION 
A. Thin Layer Method Applied to Dirac Equation 

Let us now try to apply the thin layer method to the Dirac equation. Extremizing the following action: 

S - J d 4 xfi(iy a d a - m)ip, (92) 
one obtains Dirac equation in flat spacetime: 

{if 8 a - m).A = 0, (93) 
in natural units and Cartesian coordinates. y a are vectors under general coordinate transformations and 4x4 



matrices satisfying 11411 : 

[f, 7 h ] + = y a f + y h y = 2r } ab , (94) 
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where rj ah = diag{\, -1,-1,-1). The solution of this equation, if/ is called a four spinor and is a scalar under 
general coordinate transformations (3J]. 

When one considers an arbitrary curved spacetime, or an arbitrary orthonormal curvilinear coordinate 
system in flat spacetime, the partial derivative is replaced by some covariant derivative D^, and the action 



becomes 



31: 



S = J ip{iy a E a 11 Df, - m)ip ^Pdd 4 q, 



(95) 



where the quantity E a 11 will be defined below. 



Let us now consider part of the discussion given in 11411 . Under a general coordinate transformation to 
some set of orthonormal curvilinear frame: 

is the relevant relation. Notice that since y" are constant matrices they commute with coordinate transfor- 
mation matrix elements. One observes here that y M are coordinate dependent. Then, in some orthonormal 
frame which is not necessarily Cartesian, d94l is the relevant relation again. But then, in this general curvi- 
linear orthonormal frame, one has: 



e a = e a ^dtf, (97) 



satisfying 



ds 1 = G flv dq^dq v = e a e b r] ab , (98) 
e atl = r 1ab e b M , e a ^G llv e a v , (99) 

as the basis one forms, which are not in general integrable, that is, they are not exact differentials of any 
scalar function. Then, one has to introduce a proper connection for the covariant derivative. 

There is one more condition to be met when working with the Dirac equation. One can apply a similarity 
transformation to the y matrices, and require the equation to remain invariant under such a transformation. 
This transformation is called a spin transformation, and an equation remaining invariant under such a trans- 



formation is called a representation invariant equation 11411 . Now, let us consider the flat spacetime Dirac 
equation in general curvilinear coordinates: 



(iy% - mWq) = 0. 



(100) 
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The spinor is a coordinate scalar, as mentioned above, then the covariant derivative simply becomes a partial 
derivative. Now applying a similarity transformation to the y matrices: 

y"* = N'^N, (101) 

(102) 



>(iN~ l 'fNd ll - m)AT _ V = ^[iffa + AW" 1 ) - mji^ = 0. 



(103) 



Here N is also a coordinate scalar. One immediately observes that the form of the equation is now different, 
that is, the equation has not remained invariant. In order to overcome this problem, one introduces the so 
called spin covariant derivative with its proper connection which leaves the equation invariant under both 
general coordinate and spin transformations: 



D^iq) = if/(q) v + T^{q)<p{q), 



(104) 



where are called Fock-Ivanenko coefficients. Although this representation will not be used in the fol 



lowing parts, the above discussion followed from 11411 briefly gives the reasoning which lies under the 



requirements that one should impose on the equation. The proper covariant derivative which is used in our 
work has the following connection [3J] : 



where 



OJ M = ^0) abfl [y a ,y b ], 

co bfi = -Eb v^e y = -Eb (o^e v - V^e A ), 



ds 2 = e a ^ v dcfdq\ 

e\E b » = 5 a b , e\E a v = 6\. 



(105) 
(106) 

(107) 
(108) 



e a fj are called vierbeins (vielbein in general dimensions). 

Now, let us try to apply the thin layer method to the Dirac equation. We assume that (similar to |4j,|5|]), 
our particle is constrained to move on a surface f(x, y, z) = 0. The metric in curvilinear coordinates is then: 

1 



[Gay] = 



-dj 



(109) 



0-1 

where the spatial part of the metric is -1 times the metric introduced by Ferrari and Cuoghi up to 0(q 3 ). 
The reason is that Dirac equation has only first order derivatives, and an expansion to first order in q 3 will 
be enough. Then, we introduce the vierbeins: 

e a i = d t x a + q 3 H ijg jk d k x a , e° = 1, e a 3 - N a , others = (110) 
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where the subscript zero denotes time coordinate and N" are the Cartesian components of the surface normal 
(which is a unit vector), just as in Chapter 2. These give the inverse vierbeins: 

E a '' = J]ab(g ij ~ q 3 H i j)d j x h , E = 1, E a 3 = N a , others = 0. (Ill) 

In order to calculate the spin connection coefficients, Christoffel symbols are to be calculated. They have 
already been in service, calculated in Chapter 2, equations (|35T ). Using these and the definition of spin 
connection, one obtains: 

of do = 0, 

(jf d3 = 0, 

of di = & c di - N c H ki E d k + 0(q 3 ), (112) 

by direct calculation. Since the connection coefficients are not differentiated, one does not need the ex- 
plicit expression of 0(q 3 ) term. The over tildes indicate that the quantity is calculated using only surface 
coordinates, that is: 

Ed k - rj da g kl e a u 

oj c di - -E d k {di~e c k - f M e c ,), 

ffei = \s l} (gik,i + gu,k ~ gki,l)- (! 13 ) 



Next, one rescales iff as in Chapter 2. This has been done in another article 1 10TI . and the rescaled spinor is: 

F=l+q 3 Tra. (114) 

Notice that the rescaling factor is 0(q 3 ) this time. Now, varying the action d95l ) with respect to iff, one 
obtains the curved spacetime Dirac equation: 

(iy a E a 11 - m)iff = 0. (115) 

Putting everything into (11151) . one obtains: 

iy°doX + l -y a E a W0 C di - N c H ki £d *)[/, /k 

o 

+ iy a E a l d iX + iY'N a (d 3X - \{Tr a) X ) - m X ) - (116) 

1 >q 3 =0 

in the q 3 = limit. Notice that this equation is also separable, that is, dynamics involving 2D surface 
coordinates and the coordinate orthogonal to the 2D surface can be decoupled. But notice also that, the 
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representation of Dirac matrices is still 4 dimensional, so one may search for an alternative representation 
which is indeed written in 3 spacetime dimensions iTlofl . 

i 

Another observation one makes here is that this equation includes an additional potential term, --y"/V a 7> a 

in natural units. This potential arises from the rescaling of the 4 spinor \fj. There is one other additional 

term in the equation, coming from the connection coefficients: -y a E a l rjb c N c H kj E d k [y b ,y d ]. This term 

8 

can also be interpreted as an additional potential term, but whether this term is a real scalar or not is to be 
verified. Such a term did not appear in the treatment for Schrodinger equation. This difference arises from 
the difference between the covariant derivatives. 



VI. CONCLUSION 

In our work, we have analyzed two different approaches [4j, |6Q, that of da Costa and Dirac, and two 
valiants of Dirac's approach |8J]. Another approach, that of Golovnev, has not been analyzed but derived 



llll ). We have also discussed how 



from the so called 'conservative constraint condition' of Ikegami (see (8|, . 
Dirac's approach can be adjusted to give the same result with da Costa. The reason for doing so is that da 
Costa's approach is rather geometrical, and is independent from the quantization procedure; that is, whatever 
the wave equation is, one can expand the equation in powers of one coordinate (which essentially is small 
compared to the length scales relevant to the system) and calculate the limiting equation as this coordinate 
approaches to zero. This fact makes this approach to be closer to a correct description; whatever that 
description is. However, Dirac's procedure is algebraic, and involves the canonical formalism of classical 
mechanics and canonical quantization. Such a procedure seems to be necessary for dealing with constrained 
quantum systems, since the experimentally verified applications of quantum theory are based on certain 
quantization procedures. Dirac's work reveals some ambiguities in such a trial 1 1, ^]. As stated above, 
the geometrical approach of da Costa seems to be closer to give the correct result, due to being a direct 
expansion of the relevant Schrodinger equation; but the existence of geometrical constraints inevitably 
imply physical interactions, for it is impossible to constrain a system geometrically without making use of 
interactions; and as a result, Dirac's procedure may predict different aspects of such an interaction which 
may have not been taken into account before. 

We have also tried to calculate the spin-magnetic field interaction and connection coefficients of the 
Dirac equation using da Costa's approach, which we have not been able to find in the literature. But this 
discussion requires a more detailed analysis, which will possibly be given in the future. One should also 
apply Dirac's method to write a constrained Dirac equation, to compare the results with those of da Costa's 
approach, and possible interactions which can effectively be viewed as constraining mechanisms should be 
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considered. 

Appendix A: APPENDIX 
1. Gauge Invariance of (\5\ 

In [4J], the following well known gauge transformation is mentioned: 

iOy 

A' tl =A M + 8 M y, Aq = A + d y, if/ = ^ exp(-^), (Al) 

where the subscript zero denotes time derivative and y{t,cf) is some scalar function. Putting these new 
objects directly into (fT31) and taking the derivatives, one obtains the following: 



- ^=d M ( VGOT v (^exp(^))) + ^( VG^(A V + d y y)*Fexp(^)) 
+ 2/2^ v (A v + d v y)d^tx V { 1 ^)) + Q 1 1 \A tl + d M y)(A v + d v y)^cxp(-^) 



1 

2m 



(A2) 



where Ao - —V. Now, expanding derivatives, one obtains: 

*7z3 (¥ exp(^)) = ih(d Q V) exp(^) - itu/fQ(d y) exp(^), (A3) 
^((Aexp(^)) - exp(^)^ t A + y^yC^-A, (A4) 



VGG^ v 5 v (^exp(^-))) = k( Vg<^ v 5 v »p) + 2^( VGG" v ¥)d v y 
+ l -j <G(? v Vd,d v y - ^ VGG" Vl F^yd y y) exp(^), (A5) 
5 P ( VGG" V (A V + d v y)¥exp(^)) = e xp(^)k( VGG" V A V ^) + ^ VGG" v A vl A^ r 

+ fy( VGG^>)5 v y + VGG^V^5 v y + y VGG* 1 V^ydyyJ. (A6) 

Then, putting the above terms into (fT5l) and cancelling exponentials from both sides, one directly obtains 
(fHT ) again. 

2. Dirac Brackets and @6)-(|49) 



We already know that Poisson brackets satisfy those four important properties. Then, all Poisson bracket 
terms in the definition of Dirac brackets satisfy (l46l) . The matrix entries A*/ = [<pk, (pi] also satisfy (l46l) . Then, 
both terms present in the definition of Dirac brackets satisfy this, so Dirac brackets satisfy (l46l) . 
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For (|47T ). let us write the expression explicitly: 



U + g, hf = [f + g, h] -[f + g, $ k W k l l<t>i, h] 

= If, h] - [f, 0*]Ar/ [0 Z , h] + [g, h] - [g, <p k W k l V<Pu h] = [f, hf + [g, hf. (A7) 



[fg,h]* = [fg,h]-[fg,<f> k ]A-J[<f> l ,h] 

= [f, h]g + f[g, h] - ([f, <f> k ]g + f[g, 0,]) A7, 1 [0,, h] = [f, h]*g + f[g, hf. (A8) 



[/, ig, hf]* = [/, lg, hf] ~ [/, <PnAKl [<Pn, [g, hf] 

= [f, [g, h]] ~ [f, [g, (l>kWu [0/, h]] - [f, <p m ]A^([<f> n , [g, h]} - [<P n , [g, 0fc]A*/ [fa, ft]]) 

= If, [g, h]] - [f, [g, M]A;l [0/, h] - [f, A k l][g, fc ][0 z , h] 

- [f, [0,, h]]A£ [g, <p k ] - [f, <f> m ]A^ n [cpn, [g, h]] 

+ [f, m ]A^[0„, [g, ^]]A7/ [<f> h h] + [f, 0„JA^[0 n , Aylig, (p k Mh h] 

+ [f, m ]A^[0„, [0,, h]]A7} [g, <p k ]. (A9) 



It is obvious that the first term, which is an ordinary Poisson bracket, satisfies the Jacobi identity. Now, let 
us consider the second, fourth and fifth terms. When /, g and h are permuted cyclically, and the nine terms 
are summed, one obtains: 



[h, <p k Wu U, fe. 0/U + l<f>i, U, g\\ + fe, [0/, /]] + fe, <PkW k , Ut, If, m + [4>i, [h, /]] + [f, [0/, h]] 



Now, consider the sixth and eighth terms (with /, g, h permuted cyclically and the six terms summed): 



For (|48T ). let us again write the expression explicitly: 



For d49l ), it is useful to follow the direct proof given in the Appendix of 11311 : 




(A10) 



[/, (f> m ]A~ n A^f [h, 0,] [0„, [g, fa]] + [0*, [0„, g]] 



+ [g, <Pm] A mtl A kI [f, 0,] [0„, [h, 0fc]] + [0 fc , [0„, &]] 



m ]A m ;,A w 1 [g, 0,] [0,„ [/, fc ]] + [0 fc , [0„, /]] 



- - [/, (p m ]A mn A kl [h, (p,][g, [0 t , 0„]] - [g, m ]A m „A yt/ [/, 0/][/i, [0 fc , 0„]]- 



[/j, (f> m ]A mn A kl [g, <p,][f, [0 fc , n ]], 



(All) 
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using the Jacobi identity. Remembering that 

A^r [<f>n,<f>kl - 8nh 
[F(q,p),6 nl ] = 0, 

=>[F, A^][0„, fa] = -[F, [0 B , <f> k ]]A~ kl l , (A12) 
where one uses the Jacobi identity while passing to the last line. Then one obtains: 

- [/, m ]A"^ [h, <f>,][g, [<p k , <p n ]] - [g, <f> m ]A m lAu [/> 4>i\[h, [<h, <M\- 
[KM^^u\g,W,[^kM = -[f,<t>k\[h,<Pi][g,Au] - [g,ti\[f,mK*u'\ 
-[h,4>k][g,4>i\[f,t*il (A13) 

Notice that when /, g, h are permuted cyclically in the third term of (1A9I) and summed, the resulting expres- 
sion cancels the above expression. 

We are left with the seventh term of ( IA9I ). Permuting /, g, h cyclically and summing the three terms, one 
obtains: 

[J, m ]A~J,[0 n , Ajy ][g, <p k ][<f>i, h] + [g, (f> m ]A m l[4> n , A k f][h, faMu f] 
+[h, <p m ]A m ),[<f> n , A~ k l][f, <p k ][<Pl, g] 

-If, <Pm\Vg, <Pk\{<Pu h^A^X^n, A k j] + AJ^fa, A^\ + A^[0„,A^]J. (A14) 

Now, remembering that [F(q, p), A^][0 fc , a ] = -[F{q, p), [<p k , a ]]A"J., one gets: 

A- a l [F(q, p), A m >.][<^, <Pa\ = [F{q, p), A~\] = -[F(q, p), [fa, M^A'J . (A15) 

Then, putting F(q, p) = cf> n : 

A^[fa, A,/] + A,, 1 [0 n , A^] + A,' [0„, A^ 1 ] 
= - [0,„ 0«, 6 ]]A^ A,/ A^ - [0„, [0 fl , ^IJA^A,, 1 A,, 1 - [<f> n , [<p a , fc ]]A^A^A^. (A16) 

Since Jacobi identity applies to Poisson brackets, one can write the equivalent of [<p n , [(p a ,<pb]] using the 
identity. But for [(p n , [<p a , <pb]], applying the Jacobi identity is equivalent to permuting n,a, b in a cyclic 
order. So, using the Jacobi identity for [<p n , [<p a , fa]] in one of the terms on the left hand side in this manner, 
one sees that the expression equals to zero. This completes the proof of ( |49l ). 
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3. Dirac Procedure with Geometrical Constraint 

In order to make calculations one needs to calculate the Dirac brackets first. Using the usual transition 
to Hamiltonian dynamics, one obtains: 

H = T-p a p a + V(x) - Af(x). (A17) 
2m 

The procedure requires adding the primary constraints to this Hamiltonian and defining a new Hamiltonian 

H*; 

H* = H + up A , (A18) 

where u is some multiplier. Then, one needs to find the consistency conditions by calculating Poisson 
brackets of the primary constraints with this H* | 1 , (J : 

01 = PA * 0, (A19) 

02 = [<f>i,H*] = [ PA ,H*] = = f{x) * 0, (A20) 

OA 

03 = [02, H*] = [f(x), H*] = d a f{x)^- = p a d a f{x) * 0, (A21) 

OPa 

4 = [03, H*] = [p a d a f(x), H*] = p a p b d a d b f{x) - 8 b (v(x) - Af(x))d h f(x) * 0. (A22) 

The procedure terminates here since the last constraint includes A and calculating the Poisson bracket of 
this with H* and (weakly) equating to zero will bring a condition on u. Then, there are four constraints in 
total. Now, Poisson brackets of the constraints with each other are to be calculated. Nonzero brackets are: 

[01,04] = -^/^), (A23) 

[02,03] = d b d*f(x), (A24) 

[02, 4 ] = p%(d c f(x)d c f(xj), (A25) 
[03,04] = 2p a p c (d b d a f{x))(d h d c f(x)) - d a f(x)d a (p a p h d a d h f(x) 

- d b (V(x) - Af(x))d b f{x)j. (A26) 
Like Ogawa et. al, let us define: 

[02, 3 ] = a, [0 2 , 4 ] = -0, [04, 03] = 7- (A27) 
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Then, one constructs the following matrix and calculates its inverse to find the Dirac brackets 

-a 



ill 



[A w ] = 



a -p 
-a -y 



with the inverse: 



[Ah 1 ] = 



a 


P y 








_7_ 


£ 


a- 




y_ 





1 


a- 


a 




1 





\i 


a 








a 







(A28) 



(A29) 



Now, employing the definiton ([601) of Dirac brackets, one obtains the following fundamental bracket rela- 
tions |(]]: 



[x a ,p b ] D = 6 a b -n a n b , 
[x a ,x b ] D = 0, 

[p a ,Pb]D = p c {n h d c n a - n a d c n b ). 



(A30) 
(A31) 
(A32) 



Notice that Ogawa et.al.'s notation for Dirac brackets is used again. The classical (Lagrangian) equations 
of motion are: 



x a = -d a V(x) + Ad a f(x), 
f{x) = 0. 



(A33) 
(A34) 



dL d dL 

Notice that the second equation is obtained from (— ) = 0. Since Dirac brackets are denned, all 

oA at dX 

constraint equations can now be treated as strong equations. Then, the Hamiltonian equations of motion in 
terms of Dirac brackets read: 



x a = [x a ,H] D = p b (6" b -n a n h ) = p«, 



(A35) 



Pa = [p a , H] D = p b p c (n b d c n a - n a d c n b ) + (n a n d - 6 d a )d d V(x) - (n a n d - 6 d a )d d V(x), (A36) 



x a = [x a ,H] D = [p a ,H] D = (n a n d - 5 d a )d d V{x), 



(A37) 



using p a n a - 0. To see the equivalance of those equations, one can solve <p4 = for A (using p"d b d a f = 
d b (p"d a f) = 0, since p a n a = from <pi = 0, and the definition of n a ) and get: 

d b v(d h f) 



A = 



d c f(d c f) ' 



x a = -d a V(x) + Ad a f(x) = -d a V(x) + n a n h d b f{x). 



(A38) 
(A39) 
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After the coordinate transformation, the brackets become (at the classical level): 

[<A/>vb = ^L*°, QfPbk) = & v - n»n v , (A41) 
dx" dx b 

[p M ,P V ]D = [-^Pa,-^Pb]D 

dx cl dx* dx a dx b dx b dx a 

= W [Pa ' W ]DPb + [ W' Pb]DPa W + WW lPa,Pb] = (M2) 

To see that [p M ,p v ]D - 0, first notice that: 

[F(q), Pi Ad = d^Fiq^q^PfA - d^F{q)[(f, <f> k ] A kI l [0,, p v ] = d^F(q)[(f ,p v ] D , (A43) 

dx bK dq»> dx b dq^dq»> 

Using (IA44I) . the first and second terms cancel each other. In the third term, [p a , pb\r, is antisymmetric in a 
and b, but the coordinate transformation matrix elements are symmetric, so this term is zero, meaning that 
the whole expression equals to zero. 

At this point quantization is performed; that is, Dirac brackets, satisfying the denning properties and 
(IA43I ). are replaced by -ijfi times commutators. Making use of the coordinate transformation hypothesis of 
Ogawaef. ai, one obtains (using p a n a — > {p a ,n a })\ 



[q»,q v ] = 0, (A45) 
dx" 1,3^ „ „ dx a 

W Pa)] = l(w lq ' Pa] + l(f ' Pa] W 

dq v dx b 



„ dx" \/dx a „ „ dx", 

= lTZin;tf,P*\ = V y ~ «V) (A46) 



dx a dx b 

[P^Pv\ = V{^Pa}A-^,Pb\] 

l(dx b dx a dx" ( dx b dx b x / dx b 

= iw v W Pb]Pa + w (lPa ' w ]Pb + w [Pa ' Pb] > + ([Pc " w ]Pb 

dx b sdx° dx b dx a dx"' dx b dx" , dx b 

+ W [Pa > Ph] !w +Pa W [ W' Pb] + V W Pb] W P " + dq-^ [Pa ' Pb] W 

r dx b A / dx b r dx h A dx a dx" dx b \ n flAen 

Within the last equation, there are some obvious cancellations. Using (IA44I ). the first term cancels the 
second, and the eleventh term cancels the twelfth. Since [p a ,Pb] is antisymmetric in a and b, but the 
coordinate transformation matrix elements are symmetric, third and tenth terms vanish. Then, one realizes 
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that: 

dx h dx a dx b dx"' 

[Pa 'W ]Pb W +Pa W [ W' Pb] 

dx h dx" dx b dx" dx b dx a dx" dx b 

=[Pa ' W ]lPb > W ] + [Pa > W ]l W' Pb] + lPa ' W ] W Pb + Pb W lPa ' W l (A48) 

Notice that the first and second terms on the right hand side of (IA48I) cancel each other, while the third and 
fourth terms cancel the seventh and sixth terms of (IA47I) (using (IA44I )) respectively. Then, what is left is 
the following: 



dx b dx"' dx" dx b 

W [Pa ' Pb] W + W [Pa ' Pb] W 



[Pfi, Pv] = —[Pa, Pb]— + —[Pa, Pb\ — • (A49) 



One immediately observes that the left hand side is antisymmetric in jj and v, while the right hand side is 
symmetric. The only possibility is then [p M ,p v ] = 0. 

What is rather interesting about this procedure is that q° and po commute with every other operator, 
indicating that uncertainty relations are violated in the normal direction. 

It is now time to derive the Hamiltonian found in 
brackets are defined, and V(x) simply becomes V(q) after a general coordinate transformation, one has: 



fl.- 1 



Remembering that H = — p a p" + V(x) after Dirac 



H ^ l 2 6ab[d £' p ^ {d £' pv]+v{q) ' (A50) 

H-V(q) = K, (A51) 
v X Av d # M +0 5t ?\ dq v d<f dq> d?dq> \ 

K = s 6 l 2[ a?'^a?^ + 2p ^ lp - d~? ] + [ w> p » ][Pv > d~? ] + 4p ^w Pv ) 

1 ,, 1 ,J dcf dq v dqV dq v \ 1 nh dq^ dq v 

Let us first concentrate on the following expression: 

[g- V4 ,Pi] = -lg- 5/4 djg[q j ,Pil (A53) 
4 

djg = gg ik g ikJ , (A54) 

rij = \g ik gik, h (A55) 

^- 1/4 .Wl = -fr 1/4 lt«. (A56) 
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where g = det(gij). Now, consider the fourth term of (1A52I) : 
q° Lt— ,P/AVPv,-r— J 

ZdqvdqP 6 dq^dx?" u M dq^dx b " 

<A58) 

dq a dq A ^dx a > dx a dq A ^dq a > Aa 

Now, making use of the above results and the strong equations g° = 0, />o = 0, one obtains: 

H - y- y4 Pig 1/2 8 tj Pjg- m + |A; o r; o + V(q). (A61) 



4. Calculations for Ikegami et. al. 's Work 

In this treatment, the Lagrangian is (|78T ). Momentum variables then appear to be 

p a = x a - Ad a f(x), (A62) 

and the primary constraint is again: 

0i = * 0. (A63) 

Then, 

= ^ fl V + Ad a f(x))(p b + Ad h f(x)) + V(x) + up A . (A64) 

There is one secondary constraint in this case: 

4>2 = VP a, H*] = d a f(x)(p a - Ad a f(x)) * 0. (A65) 

Notice that we are still at the classical level, so the brackets are ordinary Poisson brackets. <p\ has zero 
Poisson brackets with both x cl and pb, and the matrix A.^ used in the definition of Dirac brackets is an- 
tisymmetric. This means, the term involving in the definition of Dirac bracket is zero for both; the 
fundamental brackets are then those obtained from ordinary Poisson brackets: 

[x a , x b ] D = [p a , p% = 0, [x a , p b ] D = 5" h - (A66) 



37 



Now, solving 02 = for X, one obtains: 



{dgf{x)) P a 

d b f(x)(d h f(x)Y 
This gives the following classical Hamiltonian iQ] : 

H = -p a (6 ab - n a n b )p b + V(x). 

Let us now concentrate on T n . 

T„ =-\An a p a n b p b + Bp a n a p b n b + CP a n a n b p h + Dn a p aPb n b + 



Ep a n b p b n a + Fn a p b n b p a + Gp a p h n a n b + Hn a n b p a p h 



=(A + B + C + D + E + F+G + H)p a n a n b p b + Bp a n a [p b , n b ] 
+A[n a ,p a ]n b p b + F[n a ,p b ]n b p a + Ep a n b [ Pb ,n a ] + D[n a , Pa ] Pb n b 
+Dp a n a [p b , n b ] + G Pa [Pb, n b ]n b + Gp a n a [p b , n b ] + Hn"[n b , p a ]p b 
+H[n a ,p a ]n b p b . 



(A67) 



(A68) 



(A69) 



Now, let us try to write the geometric potential of da Costa in terms of the commutators of n a and p a 
(assuming the q 3 — > limit of the metric given in |^, 5]): 



dx a M dx a dqJ dx a 



(A70) 



Notice that we used the definition of the Weingarten matrix. Now, one should notice that don" corresponds 
to the change in the Cartesian components of n along the direction of n. Since n is by definition a unit 
vector, in the coordinates that are chosen, dgii = 0. Then: 



d a n a = aj' = Tr a. 



{All) 



Using a similar reasoning, one notices that: 



dx a M dx b dx a dx b 3 



dq' jdx»dq* t dx a j 2 
— — — a; J — : — -rav — - = ai J ct\ =ir{a). 
dx" dqJ dx b dql 1 



So, one searches for: 



(A72) 



V s = (2Tr(a 2 ) - (Tr a) 2 ) = (2[p a ,n b ][n a , p b ] - [p a ,n a ][n b , p b ]) . (A73) 
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Hermiticity requires (I88T ). Using these conditions, one has: 

2T n =(2Re(A) + C + D + 2Re(E) + 2Re{G))p a n a n h p b + [n a , p a ]((A + G*)n b p b + Dp b n b ) 
+{A* +D + G)p a n a [p b ,n b ] + (E + G)p a n b [p b ,n a ] 

+(E* + G*)[[n a , p b ],n b p a ] + (E* + G*)n b p a [n a , p b \ (A74) 
First, notice that: 

[[n a , p b ], n b Pa ] = n b [[n a , p b ],p a ] (A75) 
since n b is only a function of coordinates and so is [n a ,p b ]. Then: 

[[n a , p b ], n b p a ] = -n b [[p a , n a ], p b ] (A76) 
since [p a , p b ] = 0. Notice here the following: 

n b [[p a , n a ], p b ] = fi 2 n b d b Tr a = rfn^d^Tr a = n 2 n°d Q Tr a = 0, (A77) 
since Tr a is an object which is evaluated at q 3 = 0. Now imposing 

A + G* = -D, E + G - E* + G* , 2Re{A) + C + D + 2Re(E) + 2Re(G) = 1, (A78) 

one obtains 

T„ - \[p a n a n b p h + h 2 (D(Tr a) 2 - (E + G)Tr(a 2 )j. (A79) 

This result implies that there are two purely arbitrary constants, D and E + G in this approach, so one has to 
impose some other conditions on the complex numbers to fix the ordering. Including the number of purely 
arbitrary coefficients, this is the result obtained by Ikegami et. al. 
Now, let us consider the ordering hypothesis of Ogawa et. al. 

T„ = l -{{n a ,p a }{n b ,p b }} 

I „ „o _ „b . _ „b „„<!.„„ „b„a 



Z 2 PaP ~ 4\~ p " n PbH + PaH PbU + PaPbn n 



+n a n b p a p b + n a p b n b p a - n b p b n a p a - 2n b p b p a n a I. (A80) 



Here one expects to find 



D = --, E + G = --. (A81) 
4 2 



Here, the corresponding results are: 

D = 1, E + G - -1, (A82) 
which mean that this ordering does not give the desired result. 
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